Cluster analysis and mathematical programming
نویسندگان
چکیده
Given a set of entities, Cluster Analysis aims at finding subsets, called clusters, which are homogeneous and/or well separated. As many types of clustering and criteria for homogeneity or separation are of interest, this is a vast field. A survey is given from a mathematical programming viewpoint. Steps of a clustering study, types of clustering and criteria are discussed. Then algorithms for hierarchical, partitioning, sequential, and additive clustering are studied. Emphasis is on solution methods, i.e., dynamic programming, graph theoretical algorithms, branch-and-bound, cutting planes, column generation and heuristics. Résumé Étant donné un ensemble d’objets, la classification automatique a pour but de trouver des sous-ensembles, ou classes, homogènes et/ou bien séparées. Comme de nombreux types de classification et critères d’homogénéité et de séparation sont dignes d’intéret, ce domaine est varié. On en présente une revue, d’un point de vue de programmation mathématique. On discute les étapes d’une étude de classification, les types de classigication et les critères. On étudie ensuite les algorithmes de classification hiérarchique, de partitionnement, de classification séquentielle et additive. On insiste sur les méthodes de résolution, c’est-à-dire la programmation dynamique, les algorithmes de graphes, les procédures d’optimisation par séparation, la génération de colonnes et les heuristiques. Acknoledgment: Corresponding author. Research supported by ONR grant N00014-95-1-0917, FCAR grant 95-ER-1048 and NSERC grants GP0105574 and GP0036426. State-of-the-art survey to be presented at the XVIth Mathematical Programming Symposium, Lausanne August 25–29 1997, to appear in Mathematical Programming, B.
منابع مشابه
A Mathematical Modeling for Plastic Analysis of Planar Frames by Linear Programming and Genetic Algorithm
In this paper, a mathematical modeling is developed for plastic analysis of planar frames. To this end, the researcher tried to design an optimization model in linear format in order to solve large scale samples. The computational result of CPU time requirement is shown for different samples to prove efficiency of this method for large scale models. The fundamental concept of this model is ob...
متن کاملAnalysis of LAI in Iran based on MODIS satellite data
This study was performed to evaluate the extent of leaf area in Iran from (2002) to (2016) using Remote sensing. For this purpose, we extracted data collection and leaf area index for the Iranian territory from MODIS website. The database was established with programming in MATLAB software to perform mathematical and Statistical calculations repeated. After the analysis of the data in this soft...
متن کاملMathematical solution of multilevel fractional programming problem with fuzzy goal programming approach
In this paper, we show a procedure for solving multilevel fractional programming problems in a large hierarchical decentralized organization using fuzzy goal programming approach. In the proposed method, the tolerance membership functions for the fuzzily described numerator and denominator part of the objective functions of all levels as well as the control vectors of the higher level decision ...
متن کاملRecent Advances in Mathematical Programming for Classification and Cluster Analysis
This chapter is focused on recent advances in mathematical programming methodologies in data mining research, which is a rapidly emerging interdisciplinary research area. The main focus of this review chapter lies on classification (supervised learning) and clustering (unsupervised learning), which are among the most studied data mining tasks. We give a thorough discussion on the mathematical m...
متن کاملA Mathematical Optimization Model for Solving Minimum Ordering Problem with Constraint Analysis and some Generalizations
In this paper, a mathematical method is proposed to formulate a generalized ordering problem. This model is formed as a linear optimization model in which some variables are binary. The constraints of the problem have been analyzed with the emphasis on the assessment of their importance in the formulation. On the one hand, these constraints enforce conditions on an arbitrary subgraph and then g...
متن کاملA multi-period fuzzy mathematical programming model for crude oil supply chain network design considering budget and equipment limitations
The major oil industry upstream activities include the exploration, drilling, extraction, pipelines installation, and production of crude oil. In this paper, we develop a mathematical model to plan for theseoperations as a crude oil supply chain network design problem.The proposed multi-period mixed integer linear programming model entails both strategic (e.g., facility location and allocation)...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Math. Program.
دوره 79 شماره
صفحات -
تاریخ انتشار 1997